Optimal Prediction of Time-to-Failure from Information Revealed by Damage 
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We present a general prediction scheme of failure times based on updating continuously with time the proba- 
bility for failure of the global system, conditioned on the information revealed on the pre-existing idiosyncratic 
realization of the system by the damage that has occurred until the present time. Its implementation on a simple 
prototype system of interacting elements with unknown random lifetimes undergoing irreversible damage until a 
global rupture occurs shows that the most probable predicted failure time (mode) may evolve non-monotonically 
with time as information is incorporated in the prediction scheme. In addition, both the mode, its standard devi- 
ation and, in fact, the full distribution of predicted failure times exhibit sensitive dependence on the realization 
of the system, similarly to "chaos" in spinglasses, providing a multi-dimensional dynamical explanation for the 
broad distribution of failure times observed in many empirical situations. 

PACS numbers: 05.10.Gg ; 91.30.Px; 05.10.Cc ; 45.05.+X 

Systems of connected and interacting elements often fail through a self-organizing cascade process. Predicting 
the remaining lifetime of a complex structure or the precise time of failure remains an unsolved problem for all ap- 
plications (engineering structures, materials, earthquakes, grids, networks, groups and so on), notwithstanding its 
huge importance and overwhelming consequences. Different strategies include deterministic modeling, stochastic 
one body or many body approaches, computational intelligence methods, and many other classifiers and pattern 
recognition techniques, all with limitations and lack of sufficient understanding of the underlying physical mech- 
anisms. A major problem is that failure of a given system is highly history- and sample-dependent: in contrast 
with standard statistical physics, the problem is not to calculate an ensemble averaged thermodynamic property 
but to obtain a precise statement for each single idiosyncratic realization. This difficulty is bypassed for instance 
in the strategy which consists in viewing material rupture as a kind of universal critical transition |Tl (see however 
0]), which is based on the hope, which is partially supported by experiments, that a large system may behave like 
a typical realization with a kind of self-averaging property. But this misses the real practical challenge which is 
to detect the possible existence of flaws, either pre-existing or self-organized which are known to create a great 
variability of the lifetimes from one sample to the next. In addition, existing methods are often mute on the limits 
of predictability and on the sensitivity to various elements of the system under consideration. 

Here, we analyze this prediction problem with a simple prototype of interacting elements with unknown random 
lifetimes undergoing irreversible damage until a global rupture occurs, the so-called time-dependent hierarchical 
fiber bundle model |3|. By obtaining the absolute best prediction scheme in a probabilistic sense, we are able to 
cast new light on the above questions. Consistent with the information usually available in realistic situations, 
we assume the knowledge of only the statistical properties of the constituting elements but not of their specific 
realizations. We use the physics of their interaction to develop the prediction scheme. The key idea is to update 
continuously with time the conditional probability for failure of the global system, conditioned on the information 
revealed by the damage that has occurred until the present time. Continuously collecting information on the on- 
going damage progressively reveals key information on the pre-existing idiosyncratic realization of the system 
which can be gradually integrated in a better and better probabilistic prediction. 

Consider a hierarchical structure of elements with N levels loaded with a stress cr per element. The first level 
is made of the individual elements, the second level is made of pairs of elements, the third level is made of pairs 
of pairs and so on. This defines a discrete hierarchical tree of local coordination 2 (the results below are easy to 
extend to any coordination). This topology impacts the dynamics of failure in the following way. When one of the 
two bundles of a given pair fails, its stress load is transfered instantaneously to the surviving bundle, such that its 
load is doubled. When this bundle breaks, its load is transfered to the pair of bundles associated to it if this second 



'Electronic address: [vitting@unice.tr, somette@moho.ess.ucla.edu| 



2 



pair is still present. Otherwise, it is transferee! to the pair of two pairs linked at the next hierarchical level. Given 
some stress history a{t'),t' > 0, an element is assumed to break at some fixed random time, where the probability 
that this random time takes a specific value t is specified by its cumulative distribution function 

Fo{t) = ^ Po{t')dt' = 1 - cxp I^-kJ^ Ht'Wdt'^ ■ (1) 

This amounts to considering an element failure as a conditional Poisson process with an intensity which is function 
of all the past stress history weighted by the stress amplification exponent p > 0. Applied to material failure, this 
law captures the physics of failure due to stress corrosion, to stress-assisted thermal activation and to damage. A 
system of 2^ elements is fully specified by attributing to each element i = 1, 2^ at the beginning of their 
history a fixed failure time ti taken from the distribution Q. The failure time ti is by definition the time at which 
the element i would have broken if the stress had stayed constant equal to the initial value a. But, the elements are 
coupled through the hierarchical load transfer rule defined above. As a consequence of the hierarchical structure 
of the load transfers occuring at each rupture, the stress applied to a given element may increase, leading to a 
shortening of its lifetime. Consider a pair of bundles with lifetimes ti < t2- At time t — ti, when the first bundle 
breaks down, its load is transfered to the second bundle. It is easy to show from Jl) that this leads to a reduction of 
its lifetime to 10] 

ti2 ^ h + a{t2 - h) < t2 , a = 2-P. (2) 

This law applies for any realization of lifetimes at all levels within the hierarchy and forms the basis for our 
derivation below. 

In order to mimic a real-life situation, we consider a creep experiment of our hierarchical system, such that at 
time 0, a stress a is applied. We have no access to the specific individual lifetimes of the individual constituting 
elements, only to their probability density function (PDF) Po{x), as in a real experiment. At time passes, damage 
occurs, that is, elements break, thus revealing their initial lifetimes or combination thereof. The situation becomes 
rapidly complicated because of the interactions between the elements through the hierarchical stress-load redistri- 
bution, as the damage spreads across the levels of the hierarchy. In a real-life experiment, the damage in a material 
sample is monitored for instance by acoustic emissions, with both time and space localization. In order to construct 
our prediction scheme, we just need to construct the prediction scheme for a system of four elements (or bundles) 
with a priori unknown initial lifetimes ti,t2, and ^4, whose PDFs are known. In the case where each bundle 
reduces to an element of level 1, the PDF's are identical and equal to Po{x), as we assume that the elements have 
i.i.d. lifetimes. However, for the case of four bundles of arbitrary level j > 0, the PDF's of their lifetime are a 
priori distinct and result from the PDFs of the elementary elements at the first level combined with the specific 
history of the damage until time t undergone by each bundle, as we now explain. 

Prediction in absence of revealed damage. Let Pi{ti) denote the PDF of the lifetimes of element (or bundle) 
i, with i = 1, .., 4. If we knew ti and t2, we would determine the lifetime of the pair as i(i,2) = Min[ii, + 
a {Max[ti,t2] — Mm[ti,t2]), according to (|2} (and similarly for the pair (3, 4)). But ti and t2 are unknown, and 
the best we can do is to calculate the PDF of ^(1,2) some given time t. Conditioned on the fact that no element 
has failed, we have 

P(i,2).t (i(i,2)) = - / dti PiAti)P2.t, ([t(i,2) - (1 " a)h] /a) + (1^2), (3) 

where Pi^t{ti) — /-oo^'^^'^. — is the conditional PDF's of element z, given that it has not yet broken at time t. The 

second contribution (1 2) in (|3 corresponding to ti > t2 is obtained from the first contribution corresponding 
to ti < <2 by exchanging the two indices 1 and 2. We check that P(i,2),t (^(1,2)) is normalized to unity over 
the time interval from i to cx) by using the identity ^^^(1,2) Jt^^'^^ '^^i = It° It° '^^(1.2) ^"d the change of 

variable i(i_2) u = [t(i^2) - (1 - a)ti] /a. The PDF 2,3,4),* (^(i, 2,3,4)) of the lifetimes t(i,2,3,4) of the 
group of four elements at time t conditioned on the fact that no element has ruptured until t has the same structure 
as (|3} with the substitutions 1 (1, 2) and 2 (3, 4). 

Using the knowledge that one element failed at time t*. Suppose we record the failure of the element 1 at 
time t*, i.e., its initially unknown lifetime ti is suddenly revealed: ti = t*. Conditioned on this information 
revealed at time t*, we know proceed to derive how this impacts the prediction of the lifetime of the four elements, 
changing P(i,2,3,4),t (^(1,2,3,4)) into a conditional PDF -P(i,2,3,4),f (^(1,2,3,4))- Indeed, the failure of element 1 at 
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t* immediately changes f (i,2).t (/(i,2)) for the rapture time of the pair (1, 2) (i.e., of element 2 given that element 
1 has broken) from expression (|3j to 



^(i,2),t- (i(i,2)) = l^ht' ([i(i.2) - (1 - /«) 



(4) 



Expression (|4} derives from Q by replacing Pi,f (^i) by S{ti — t*) to express the certain knowledge of the failure 
time of element 1. It can also be interpreted as the change of the failure time of element 2 from t2 to t* + a(t2 — <*) 
by the stress transfer from element 1 to element 2 occurring at t* (and with the proper normalization of the distri- 
bution). The gain in prediction accurary described below is due to the fact that the variance of P(i^2).t* (t^i 2)j 
given by @ is smaller than that of P(i.2),t (^(i,2)) given by 

In contrast with the previous case leading to P(i. 2,3.4),* (^(1.2,3.4)) when no failure has occurred yet, the two 
pairs (1,2) and (3, 4) do not play a symmetric role and two scenarios can occur for times greater than t*, given 
that element 1 has broken at t*. Scenario 1 is that element 2 fails first, followed by the rupture of second pair (3,4). 
This scenario contains both the case where element 2 breaks first and then (3, 4) and the case when element 3 (or 
4) breaks first, then element 2 fails and then element 4. The probabiUty of this scenario is 



P^'[^(l,2) < *(3,4)] — ^*(1,2) -^(1,2),** (*(1,2)) 



dt 



(3,4) ^(3,4),t* I '•(3,4) 



(5) 



In the final calculation of Pr[i*j^ 2) < ^(3 4)]' we must use the fact that P(i,2),f (t*i 2)) is given by Q. Scenario 
2 is that the second pair (3, 4) breaks first, followed by the failure of element 2. This occurs with a probability 

P^[^(l,2) > ^(3,4)] = 1 ^ P'^[^(l,2) < *(3,4)] 

Conditioned on the fact that the rupture follows the first scenerio (t*-^ 2) < ^(3 4))' the PDF for the failure time 
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(T,2,3, 4), t' (^(1,2,3,4)) - ^^(1,2) -P(l,2),t* (^(1,2)) ^(3,4),f ( i(l,2,3,4) - (1 - a)i(l,2) /") • (6) 

Here, the PDF for the failure time of the first pair (1, 2) is changed into -P(i,2).f (^t*^i 2) j given by @. We can thus 
rewrite (|6j as 

-P(T,2,3,4),t* (*(1,2,3,4)) ^ ~2 ^^(1,2) -^2,*- ( ^(1,2) " (1 " a)t* /a) P(3,4),f ( ^(1,2,3,4) " (1 - a)t*U2) /" 

(7) 

Conditioned on the fact that the rupture follows the second scenerio (t*-^ 2) > ^(3 4)) the PDF for the failure time 



t*^^ 2 3 4) of the whole four-element system is 



(3,4) 



-P(T,2,3,4),r (*(l,2,3,4)j = ^ j^^ dtl^A) ^(3A).t' (^(3,4) j P^^t' ( ^ 

where -P(i,2),t* (*(i,2)) given by 0. 

Combining both scenarios yields the PDF for the failure time t*^ 234)°^ four-element system: 

-P(l,2,3,4),t* (^(1,2,3,4)) = -P(T,2,3,4),t* (*(1,2,3,4)) + -f(T,2,3,4),t* (*(1,2,3,4)) ' 



1-a 



(8) 



(9) 



where the two terms in the rh.s. of are given respectively by Q and (jSJl. We verify that the PDF 



^(1 



(1,2,3,4), t* 1 '-(1,2,3,4) 



"(1,2) < *(3,4)J 



is normalized to unity as J^, f(i, 2,3,4),** (^(1,2,3,4)) '^^(1,2,3,4) = P'^I^a,: 
Pr[t*j^ 2) ^ ^^(3 4)] ~ since the integral of gives Q, and the integral of (jS) gives the complement to 1, using 

the identity J^°° dy f^, dx ~ dx /^°° dy and a change of variable. 

Two elements are broken in the same pair (i.e. scenario 1 is fulfilled). Suppose that element 2 breaks at 
some later time > t* before the rupture of the pair (3, 4). This rupture reveals a new information which can be 
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exploited to improve the prediction of the rupture time of the 4-element bundle. Indeed, expression ^ is changed 
into 



^(1 



(1,2,3 



4),*-,*t ^i(i,,,3,4)J - /^7da;P(3,4),*t([^-(l-a)tt]/a) 



for t > (10) 



This corresponds to a considerable decrease of uncertainty: first, scenario 2 is now excluded and, second, the distri- 
bution ^ is collapsed similarly to the process leading to at time t*. The denominator ensures the normalization 

of -P(i,2,3.4),t*,tt (^(1^2 3 4)) interval [i^, +oo] and expresses the fact that -P(i,2,3,4),t*,tt (*(i^2 3 4)) ^ 

distribution of failure times conditioned to the failure time being larger than . The PDF P(3.4)^tt contains the 
information of whether element 3 or 4 (but not both) have ruptured in the mean time, according to a derivation 
similar to that leading to after the rupture of element 1 . 

Two elements are broken, one in each of the pairs (1,2) and (3,4). In this case the prediction of the rupture 
time is given by expression (|9} but the knowledge that a element broke in (3, 4) means that i^(3,4) has to be replaced 
by the expression 0, with a change of indices (1, 2) (3, 4) in 0. 

Three elements are broken with scenario 2. Suppose that the pair (3, 4) breaks at some time t'^ > t* before the 
failure of element 2. Then, again the prediction of the rupture time of the 4-element bundle is improved according 
to 



^(1,2,3,4),**, tt fe^2 3 4)) = — ° / TT— ^ > for t > ^^ (11) 

Jtt ^2,f,tt I ^ - — 



The denominator ensures the normalization of f'(i,2,3,4),t*,tt (^^(1^2 3 4)) ^^^'^ interval [t^ , +00] and expresses 
the fact that ^'(i_2.3,4).t*.tt (^(1^2 3 4)) ^ distribution of failure times conditioned to the failure time being larger 
than t^. 

Three elements are broken with scenario 1 The prediction of the rupture time is then given by expression dlOl i 
but the knowledge that a element broke in (3, 4) means that ^^(3,4) has to be replaced by the expression 0, with a 
change of indices (1, 2) — > (3, 4) in @. 

It is straightforward to iterate this enumeration for a system of arbitrary size 2^. Here, we present results 
obtained for a system of 16 elements, with identical exponential distributions of lifetimes. In order to calculate the 
PDF of the lifetime tc of the whole system, we decompose it into four bundles of 4 elements each, for which we 
calculate their corresponding PDFs. The four PDFs for each 4-bundle in turn take the role of the t used in the 
previous calculations of the PDF for the total bundle of four 4-bundles. It is important to stress that, even though 
the lifetimes of the individual elements are i.i.d., the PDFs of the four 4-bundles remain the same only as long as 
no individual element has broken and then diverge as damage grows. 

We use these formulas to obtain Figure^which shows the PDFs of the lifetime of the total system for a different 
number n of broken elements. As damage is revealed, the width of the distribution decreases which means that the 
uncertainty about when the system will fail decreases. At the same time, the most likely value of the lifetime of 
the system first increases up to n = 6 broken elements, after which the damage of the system is so important that 
a global rupture is imminent and the most likely value of tc decreases. 

Figure 12] illustrates the concept of the sensitivity of the evolution of the PDF of failure times on the initial 
randomness (analogous to "chaos" in spinglasses |4]) and documents two different ways by which the "trajectories" 
of two PDFs can diverge: i) the modes (most probable value) move apart as a function of time; ii) the width also 
exhibits sensitive dependence on the quenched randomness. Consider e.g. the PDFs represented by the continuous 
and the dashed line. Their modes were slightly different for n = 4 broken elements but then moved closer for 
n — 8 broken elements. While comparable for n = 4, their widths have evolved very differently after n — 8 
elements have failed. This illustrates the dependence upon which sub-levels of the hierarchy which have been 
broken. 

This prediction scheme based on incorporating iteratively the information on the unknown pre-existing charac- 
teristic of the systems which are revealed by the growing damage does not require a priori a complete knowledge 
of the dynamics and opens the road to a suite of approximations for real systems involving increasing degrees of 
model sophistications used in the implementation which should be tested systematically. We expect the concept 
of multidimensional dependence on initial conditions to remain a robust feature of the prediction of time-to-failure 
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in many systems, that is, there are several measures of the sensitivity to initial conditions in the divergence of the 
trajectories of the PDFs of failure times. 
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FIG. 1 : PDFs of the lifetime tc shown at different levels of damage for a system that contained initially 16 elements, just after 
the last element broke. The different curves correspond to increasing numbers n of broken elements: n — (fat solid line), 
n = 4 (thin solid line), n = 6 (dashed line), n = 8 (dash-dotted line), and n = 12 (dotted line). Inset: Evolution of the 
corresponding lifetimes (shown as the bar heights) of the 16 elements with the height representing their lifetimes. 
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FIG. 2: PDFs of five different systems of 16 elements with different realizations of the initial lifetimes of the individual 
elements, a) n = 4 elements broken, b) n = 8 elements broken and c) n = 12 elements broken. 



